arXiv:1501.00076v2 [math.CO] 21 Nov 2016 


ON THE NUMBER OF SIMILAR INSTANCES OF A 
PATTERN IN A FINITE SET 

BERNARDO ABREGO, SILVIA FERNANDEZ-MERCHANT, DANIEL J. KATZ, 
AND LEVON KOLESNIKOV 


Abstract. New bounds on the number of similar or directly simi¬ 
lar copies of a pattern within a finite subset of the line or the plane 
are proved. The number of equilateral triangles whose vertices all lie 
within an n-point subset of the plane is shown to be no more than 
L(4n — l)(n — 1)/18J. The number of fc-term arithmetic progressions 
that lie within an n-point subset of the line is shown to be at most 
(n ~ r)(n + r ~ k -\- l)/(2fc — 2), where r is the remainder when n is 
divided by k — 1. This upper bound is achieved when the n points 
themselves form an arithmetic progression, but for some values of k and 
n, it can also be achieved for other configurations of the n points, and a 
full classification of such optimal configurations is given. These results 
are achieved using a new general method based on ordering relations. 


1. Introduction 

Erdos and Purdy na i 2 ii 2 o] raised the question of finding the maximum 
number of equilateral triangles that can be determined by n points in the 
plane, where we say that a triangle is determined by a set V when the 
three vertices of the triangle lie in V. This problem is also mentioned in 
the compendia of unsolved problems in geometry by Croft, Falconer, and 
Guy png and by Brass, Moser, and Pach [9], and is discussed by Pach and 
Sharir in [27J. There are many other variations of this problem involving 
other patterns, constraints on the n-set in which instances of the pattern 
are sought, higher-dimensional ambient spaces, different definitions for what 
counts as an instance of a pattern, and different optimization objectives, 
e.g., EH El HI El El m [HI QEl EH EH EH m\- These problems trace their 
inspiration to Erdos’ question m about the maximum number of pairs of 
points at unit distance that can be determined by an n-subset of the plane, 
and other related questions, which have led to a rich literature (see [91 [24] 
for an overview). Apart from being a well-known and important question 
in discrete geometry, the problem of determining the number of instances of 
a pattern in an n-subset of the plane is relevant to the problem of pattern 
recognition in data from scanners, cameras, and telescopes [HEltToj. 

In all these problems, we have a universe, which is a set U , an equivalence 
relation ~ on the power set of U, and a pattern CP, which is an equivalence 
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class of If P € IP, we say that P is an instance of pattern IP. We call IP a 
^-pattern if all its instances are k- sets. For the rest of this paper, instances 
of the pattern IP are always assumed to be finite sets. For V C U, we let 


Sy(V) = \{PCV:PeT}\- 


We are concerned with finite subsets V of our universe U. For an integer n, 
we define 

iS'p(n) = max S'y(V’), 

I VY=n 

that is, the largest number of instances of IP that can be found in an n-subset 
of U. 

The results of this paper concern the case when U is the line R or the 
plane R 2 (identified with C), where ~ is the geometric relation of similarity 
or direct similarity)]] and where the patterns T are classes of finite subsets, 
such as arithmetic progressions in the line, or equilateral triangles in the 
plane. 

If our relation ~ is similarity or direct similarity, and the instances of 
our pattern IP have more than one point, and we are counting the number 
of instances of T in subsets of the plane, then it is not hard to show that 
Sy(n) is 0(n 2 ). Elekes and Erdos [16] proved a subquadratic lower bound 
for general patterns IP and a quadratic lower bound if there is an instance of 
y whose points all have algebraic numbers for their coordinates. Laczkovich 
and Ruzsa [23j later showed that Sy{n) = 0(n 2 ) if and only if the cross 
ratio of every quadruplet of distinct points in an instance of IP is algebraic. 
However, before this work there were no patterns T (with at least three 
points) for which the quadratic coefficient was known. 

Let £ be the pattern of the vertices of an equilateral triangle, that is, £ 
contains all 3-point subsets of the plane such that all three pairs of points 
within the set have the same distance. Then the results of Laczkovich and 
Ruzsa show that S^(n) = Q(n 2 ). Prior to this paper, the best known bounds 
[2] for S'g(n) were 


( 1 ) 




n 2 + 0(n 3,/2 ) < Sg(n) < 


(n — l) 2 
4 


so that 


0.1955 < - — < lim inf ^ < limsup 


S£(n) 


47T 







The lower bound is obtained from the points in the equilateral triangle lattice 
contained in a disk of a suitable radius. It is conjectured |2j Conjecture 1] 
that this construction is asymptotically optimal and that lim n _ >00 5g(n)/n 2 


^Two figures are said to be directly similar if one can be obtained from the other by a 
rotation and a translation. Reflections are not allowed. 
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1 /3 

exists and equals 3 — the lower endpoint of the interval in which the 
limits inferior and superior for Si(n)/n 2 are known to lie. 

The main result of this paper is an improved upper bound on S^{n). 

Theorem 1.1. If £ is the pattern of the vertices of equilateral triangles in 
R 2 , then S^iji) < [(4 n — 1 )(n — 1)/ 18J. 

Therefore lim supj^^ Sg (n) /n 2 <2/9. The proof is given in Section [ 8 ] 
and is based on an order-theoretic methodology, developed in Sections [2H4J 
for attacking pattern-counting problems. 

Our new methodology was first developed while exploring the simpler 
problem of counting instances of finite patterns (such as arithmetic progres¬ 
sions) in a finite subset of the line R, and was used to discover results that 
are presented in Sections [SHU After settling preliminaries in Section [5] we 
calculate the precise maximum number of /c-term arithmetic progressions 
that can occur in an n-point subset of the line in Section [ 6 ] The following 
is the main result of Section El 


Theorem 1.2. Let k > 1, and let he the pattern of k-term arithmetic 
progressions on R. If n £ N and r is the remainder when n is divided by 
k — 1, then SA k (n) = (n — r)(n + r — k + l)/(2 k — 2). 


For each k > 2, we completely classify all n-point subsets of the line 
that contain as many /c-term arithmetic progressions as possible. These 
optimal sets include n-term arithmetic progressions, but there are some 
other optimal sets: when k > 3 and k — 1 divides n, sets obtained by 
deleting the second or penultimate point from an arithmetic progression are 
also optimal (see Proposition 16.5[) . When k = 3, the variety of the optimal 
sets is considerably richer (see Proposition 16.3[) . This work complements the 
asymptotic approach of Elekes [15] , who studied the structure of n-subsets of 
the line that contain an asymptotically large number of /c-term arithmetic 
progressions, whereas we determine the precise maximum and those sets 
that achieve it. 

In Section [3 we show that our method can be used to obtain both upper 
and lower bounds on Sy(n) when 7 is a commensurable pattern in the 
line, that is, where the ratios of the distances between points in 7 are all 
rational. We count the number of directly similar copies of P = {0,1,3} in 
finite subsets of the line, and thereby show that if 7 is the pattern for P, 
then 


(2) 


I < lim inf < lim sup 



We then construct a family of n-point subsets V n of the line that has 
lim n _ ) . 0O Sy{y n )/n 2 = 3/16, which is the highest known value and, inter¬ 
estingly, lies strictly between the bounds of (] 2 ]h 

As noted, our order-theoretic method is a new tool for counting patterns in 
finite sets. It combines nicely with other techniques of discrete geometry, for 
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example, the proof of Theorem 11.11 combines our tools with the topological 
technique of finding three concurrent halving lines of a set (cf. [221 Lemma 
2], [2H . Lemma 3], and [141 Lemma 2]). Further applications will be explored 
in future works. 


2. Order and Decompositions 

A key theme in this work is that one can place an order relation on the 
universe U, and use it to create bounds on Sj>(n). So henceforth, we assume 
that our universe U has a total ordering relation A. If V C U, then we say 
that a point v £ V is the ith point of V with respect to ^ to mean that there 
are precisely i — 1 points w £ V with w -< v. If V, W C U, then we write 

V A W to mean that v -< w for every v £ V and w £ W. 

If V is a set and £ a positive integer, then an £-decomposition of V is 
a set of disjoint subsets of V such that (J j=1 Vj = V. (So 

a decomposition differs from a partition only inasmuch as the former may 
include empty subsets.) An ^-decomposition of a finite set V with n points is 
said to be balanced if each set in the decomposition has either [n /£J or \n/f \ 
points in it. An ^-decomposition V of V is called A -orderly if, whenever V, V' 
are distinct elements of V, then either V -< V' or V' -< V. If V is an orderly 
•^-decomposition of V, then we say that V £ V is the ith set in V to mean 
that there are precisely i — 1 sets W £ V with W -< V. 

If k > 2 and there is a ^-orderly {k — 1 ^decomposition V of V, and a k- 
subset P of V, then we say that P is of echelon j in V if the jth and (j + l)th 
points of P lie in the jth set of V. That is, if V = {Vi -<■■■-< I4_i} and 

P = {pi -< -< pk}, then P is of echelon j in V if pj,pj + ± £ Vj. It is a key 

fact that P must be of echelon j in V for at least one j. 

Lemma 2.1. Let k > 2 and let V be a set with a P-orderly (k — 1)- 
decomposition V. If P is a k-subset of V, then P is of echelon j in V 
for some j £ { 1 ,..., k — 1 }. 

Proof. By induction on k, with the k = 2 case trivial. For k > 2, suppose 

that P = {p\ -< -< pk} is not of echelon (fc— 1) in V = {hi -<■■■-< I4_i}. 

Then pk-i fL I4_i, and thus P \ {pk} is a (.k — l)-subset of V \ V*_i with 

^-orderly [k — 2 )-decomposition {Vi,..., Vfc_ 2 }, and hence there is some 
j € {1,..., k — 2} such that pj,pj + \ £ Vj by induction. □ 

Remark 2.2. It should be noted that Lemma 12.II is somewhat more detailed 
in its conclusions than the usual Pigeonhole Principle. For example, suppose 
that our universe U is M and A is the usual ordering < of real numbers. Let 

V = P = {pi < ... < pq}, and let V = {Vi,..., V 5 } be the orderly 5- 
decomposition of V shown in Figure [I] 

Since we have six points in five sets, the usual Pigeonhole Principle shows 
that there is at least one set in V containing at least two points of P; this is 
seen in V 3 , which contains P 2 and p%. But P = {p\ < ... < p$} (which is the 
same as V) is not of echelon i = 3 in V, because although V 3 contains two 
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V, F 2 F 3 F 4 V 5 


Figure 1. Illustration of Lemma 12.11 

points of P, it does not contain the particular points pi = p 3 and pi + \ = p 4 . 
On the other hand, Lemma [2. II asserts that there is some j E {1,... , 5} such 
that P is of echelon j in V, and we see P is indeed of echelon j = 4 since 
points pj = P 4 and Pj+\ = P 5 lie in V 4 . 

Of course, if k > 4, and V is a set with ^-orderly (k — l)-decomposition 
V, then a L-subset P of V may be of echelon j in V for more than one j. 

3. Reconstructibility 

Let IP be a fc-pattern in our universe U with order relation A. Suppose we 
are given some j E {1,..., k — 1} and two points u,v E U with u -< v. Any 
P E IP that has u and v respectively as its jth and (j + l)th points is called 
a reconstruction of IP with the prescribed points as its jth and (j + l)th 
points. 

If, for every j E {1,... ,k — 1} and u -< v E U, there is at least one 
reconstruction of IP with u and v as jth and (j + l)th points, then we say 
that y admits at least one reconstruction from A -consecutive points. If there 
is at most one reconstruction for every j E {1,..., k — 1} and u -< v E U , 
we say that IP admits at most one reconstruction from A -consecutive points. 
If both of these hold, then we say that IP admits a unique reconstruction 
from tL consecutive points , or is uniquely reconstructible from P-consecutive 
points. 

Example 3.1. Suppose that our pattern IP is the set of all triples that are 
vertices of isosceles right triangles, and suppose that our order relation R 
is lexicographic ordering of M 2 . (That is, (a, b) -< ( c,d ) if and only if either 
(i) a < b or (ii) a = b and c < d.) We see that a pair of points A and B 
may be the first and second points (under the relation P) of more than one 
instance of IP. For instance, if A and B are the points in Figure [2] below, 
there are three isosceles right triangles of which A and B are the leftmost 
two vertices, namely, AABCi, A ABC 2 , and AABC 3 . (There are also three 
other isosceles right triangles, AABC 4 , AABC 5 , and AABCq, with A and 
B as vertices, but A and B are not the leftmost two vertices in these.) If 
one looks at other pairs A and B, one can find out that IP always admits at 
least one reconstruction from ^-consecutive points, but often admits more 
than one reconstruction. 
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C 4 



Figure 2. Points A and B are vertices in six isosceles right 
triangles, three of which are reconstructions that make A and 
B the leftmost two vertices. 


Example 3.2. Suppose that our pattern IP is the set of all triples that are 
vertices of equilateral triangles, and suppose our order relation A is lex¬ 
icographic ordering of M 2 . We see that a pair of points A and B might 
not be the second and third points (under the relation tC) of any instance 
of CP. For instance, if A and B are the points in Figure [3] below, there is 
no equilateral triangle of which A and B are the rightmost two vertices. 
(There are two equilateral triangles, A ABC\ and A ABC 2 , with A and B 
as vertices, but A and B are not the rightmost two vertices in these.) If one 
looks at other pairs A and B, one can find out that CP never admits more 
than one reconstruction from ^-consecutive points: it sometimes admits no 
reconstruction, and sometimes admits one reconstruction. In Lemma 18.11 
we determine precisely when these two cases occur. 




A 



B 


C 2 


Figure 3. Points A and B are vertices in two equilateral 
triangles, but neither of these is a reconstruction that makes 
A and B the rightmost two vertices. 
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Example 3.3. Suppose that our pattern CP consists of all sets of points in M 
that are directly similar to {0,1,3,6}, and suppose that our order relation 
^ is the usual ordering of M. A pair of points A and B with A < B will 
always be the second and third points of precisely one instance of CP, namely, 
{(3A — B)/2, A, B, (5 B — 3A)/2}. For example, if A and B are —1 and 5, 
respectively, then the unique instance of CP of which A and B are the second 
and third points is {—4, —1, 5,14}. 


4. A General Upper Bound on Sy(n) 

Theorem 4.1. Let k > 1 and let CP be a k-pattern in U that admits at 
most one reconstruction from A-consecutive points. If n G N and r is the 
remainder when n is divided by k — 1, then Sy(n) < (n — r)(n + r — k + 
l)/(2*-2). 

Proof. Let V be an n-subset of U. and let V = {V\ -<•••-< i } be a A- 
orderly, balanced (k — 1 ^decomposition of V. By Lemma 1 2. 11 each P <ZV 
with P G CP is of echelon j in V for some j G {1,..., fc — 1}. Each subset of 
echelon j has its jth and (j + l)st elements in Vj, and since no two instances 
of CP may have the same jth and (j + l)th points, this means that there 
are at most (^) instances of CP that are of echelon j. Thus there are at 
most CV) instances of CP in V. If we write n = q(k — 1 ) + r, then 

our decomposition, being balanced, has \Vj\ = q + 1 for r values of j and 
\Vj\ = q for k — 1 — r values of j, so that 5y(U) < + (fc — 1 — r)(|) = 

(n — r)(n + r — k + l)/{2k — 2). □ 

The method of counting used in this proof provides a criterion for when 
this upper bound is achieved. 

Lemma 4.2. Let k > 1 and let IP be a k-pattern in U that admits at most 
one reconstruction from A -consecutive points. Let n G N and let r be the 
remainder when n is divided by k — 1. Let V be an n-subset of U with A- 
orderly, balanced (k — l)-decomposition V = {V\ ,..., 14_i}. Then S’y(U) = 
{n — r){n + r — k + 1 ) / (2 k — 2) if and only if for each j € ( 1 ,..., k — 1 } and 
every pair of distinct points v,w € V}, there exists a reconstruction P of CP 
with v and w as jth and (j + 1 )th points, and P is a subset ofV that is not 
of echelon i in V for any i j. 

Proof. Examining the proof of the preceding theorem, we see that if any 
of the reconstructions mentioned in the statement of this proposition did 
not exist, or did not lie in V, then (^) would be an overestimate of the 
number of instances of CP that are of echelon j. And if any instance were of 
two distinct echelons in V, then we would be counting it twice in Yl'jZ o (^) > 
thus making our bound an overestimate. □ 
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5. General Patterns in the Line 

In this section, our universe U is the line R, our equivalence relation ~ 
is direct similarity, and our pattern CP can be the equivalence class for any 
finite subset of R. Our order relation is the usual order relation < for R. 
We note that any pattern CP is uniquely reconstructible from <-consecutive 
points, so Theorem 14.11 applies, amounting to the following. 

Theorem 5.1. Let k > 1 and let CP be any k-pattern in R. If n € N and r 
is the remainder when n is divided by k — 1, then Sy(n) < (n — r)(n + r — 
k + l)/(2k - 2). 

This upper bound is tight in the sense that for each k > 1, when CP is the 
pattern of fc-term arithmetic progressions, then Sy(n) equals the bound, as 
we shall now show. 

6. Arithmetic Progressions in the Line 

Here the universe U is the line R and our pattern is Af ; , the class of k- 
terrn arithmetic progressions. Because Ak is invariant under reflection, the 
results are the same whether we make ~ direct similarity or similarity. This 
enables an exact computation of Sji k (n): we recall and prove Theorem 11.21 

Theorem 6.1. Let k > 1, and let Ak be the pattern of k-term arithmetic 
progressions on R. If n E N and r is the remainder when n is divided by 
k — 1 , then Sj\, k (n ) = (n — r)(n + r — k + l)/(2k — 2 ). 

Proof. This is an immediate consequence of Theorem 15. II above and Lemma 
16.21 below. □ 

Lemma 6.2. If k > 1 and n € N with r the remainder when n is divided by 
k — 1 , then the number of k-term arithmetic progressions in { 0 , 1 , ..., n — 1 } 
is (n — r) (n + r — k + 1 )/ ( 2 k — 2 ). 

Proof. For each positive integer s < n/(k — 1), our set {0,1,... ,n — 1} 
contains precisely n — s(k — 1 ) arithmetic progressions having k points with 
distance s between consecutive points. The result follows by adding these 
quantities for 1 < s < n/(k — 1 ). □ 

Now that we know the precise value of Sj\ k (n), we would like to completely 
classify the n-subsets V of R achieving S^ k (V) = Sj\ k ( n ). We call such n-sets 
optimal for fc-term arithmetic progressions. Lemma 16.21 above shows that 
n-terrn arithmetic progressions are always optimal for fc-term arithmetic 
progressions, but in many cases there are other optimal sets, and we now 
classify them (up to similarity). 

All sets are trivially optimal for 1- or 2-term arithmetic progressions, and 
any n-set with n < k is trivially optimal for /c-terrn arithmetic progressions. 
The barycenter of an arithmetic progression is the arithmetic mean of its 
points. Two progressions in Z are said to be concentric if they have the 
same barycenter, or nearly concentric if their barycenters differ by 1 . 
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Proposition 6.3. For n > 3, an n-subset of R is optimal for 3-term arith¬ 
metic progressions if and only if, up to similarity, it is the union E U O 
of two nonempty arithmetic progressions, where E consists of consecutive 
even integers and O consists of consecutive odd integers and E and O are 
concentric if n is odd, or nearly concentric if n is even. 

Remark 6.4. Before we embark upon the proof, we pause to note that the 
set EDO described here is an arithmetic progression when \E\ and \0\ are 
either equal, or one differs from the other by 1 . 

Proof of Proposition 1 6. S[ In view of the value of Sj \ 3 (n) from Theorem l 6 .ll 
we may use Lemma 14.21 as the criterion for optimality. Suppose that V is an 
n-subset of M and V = {Vi < V 2 } is a <-orderly, balanced 2-decomposition 
of V. Because arithmetic progressions are uniquely reconstructible from 
consecutive points, and since no 3-term progression may be both of echelon 1 
and of echelon 2 (since that would require it to have more than three points), 
Lemma 14.21 shows that V is optimal for 3-term arithmetic progressions if 
and only if for every j E {1,2} and every pair of distinct v,w E Vj, the 
reconstruction of .A 3 with v and w as jth and (j + l)th points is contained 
in V. 

First, suppose V has the form E U O as described in the statement of 
this proposition. If | E\ and \0\ are equal or differ by 1, then V is itself an 
arithmetic progression, and hence optimal by Lemma [ 6 .21 so we may assume 
that |FI| 7 ^ |0|. We may then view V as X U Y, where X is the larger of E 
and O, while Y contains the smaller of E and O, along with any points from 
the larger of E and O that lie between the points of the smaller of E and O. 
Thus X is an arithmetic progression with distance 2 between its consecutive 
points, while Y is an arithmetic progression with distance 1 between its 
consecutive points. We let V = [V\ < V 2 } be an <-orderly, balanced 2- 
decomposition of V. Then V induces <-orderly, balanced 2-deconrpositions 
of X and Y, which are X = {X n V 1} X n V 2 } and y = {X fl Vi, Y n V 2 }, 
respectively. Now suppose that v\ < v 2 are points in V}, and let V 3 be the 
point that makes {tq < v 2 < U 3 } a 3-term arithmetic progression. If v\ 
and v 2 are both in X\ (resp., Yf), then since X (resp., Y) is an arithmetic 
progression, and hence optimal for 3-term arithmetic progressions, we may 
apply Lemma 14.21 to conclude that V 3 E X (resp., Y), and hence V 3 E V. 
In the remaining case, where v\ E X\ and v 2 € Yf, we note that V 3 cannot 
be larger than the largest point in V and has the same parity as v\, hence 
V 3 € X C V. By the same argument, if v 2 < V 3 are two points in V 2 , then 
the point v\ that makes {iq < v 2 < U 3 } a 3-point arithmetic progression is 
always in V. Thus our set V satisfies the sufficient criteria of Lemma 14.21 
for optimality. 

Now suppose that V is an n-subset of M that is optimal for 3-term arith¬ 
metic progressions. We want to show that V has the form described in the 
statement of this proposition. First we deal with the case where n is odd, 
and without loss of generality, we may apply a similarity transformation 
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so that the middle point of V is 0, and the next point in V is 1. We let 
V = {Vi < V 2 } be an <-orderly, balanced 2-decomposition of V with the 
negative points in Vi, and the nonnegative points in V 2 . By Lemma 14.21 for 
every positive V 2 G V 2 , the 3-term arithmetic progression —V 2 < 0 < V 2 must 
lie in V, so the point —V 2 must lie in V\. Thus V has reflection symmetry 
about 0 . 

If u < v < w are consecutive points in V 2 , then by Lemma 14.21 the 3- 
term progression 2 v — w < v < w must lie in V, and so 2 v — w < u, that 
is, w — v > v — u. So the spacings between consecutive points of V 2 are 
nondecreasing as we proceed to the right. 

If 1 < u < v are consecutive points in V 2 , then by Lemma 14.21 the 3-term 
progression 2 — v < 1 < v lies in V, so that v — 2 lies in V by reflection 
symmetry, and so u > v — 2 . So the spacing between any two consecutive 
points of Vo is no greater than 2 . 

We also claim that all the points of V 2 are integers: we proceed by in¬ 
duction. The first two elements of V 2 are 0 and 1, so suppose that v G V 2 
with v > 1 , and we know that all elements of V 2 less than v are integers. 
By Lemma 14.21 the arithmetic progression 2 — v < 1 < v lies in V, so that 
\v — 21 lies in V by reflection symmetry, so \v — 2| must be an element of V 2 
less than v. hence an integer. So v is an integer. 

Thus spacings between any consecutive points of V 2 is always 1 or 2, and 
these spacings are nondecreasing as we proceed to the right. Since V has 
reflection symmetry about 0, we know that V has the form described in this 
proposition. This completes the proof when n is odd. 

Now suppose that n is even and V is an n-subset of M optimal for 3- 
term arithmetic progressions. Then the technical Lemma 16.61 whose proof 
is delayed to the end of this section, shows that an (n — l)-set V' obtained 
from V by removing either the leftmost or rightmost point is optimal for 
3-term arithmetic progressions. By applying similarity transformations, we 
may assume that the rightmost point r is removed and that V' has the 
form described in the statement of this proposition, with 0 being the middle 
point of V' . Then V' contains the point 1 since the set of odd points in 
V' is nonempty and centered at 0. Let r' be the rightmost point of V'. 
A <-orderly, balanced 2-deconrposition V = {V\ < V 2 } of V places all the 
nonpositive points into V\ and all the positive points into V 2 . 

In view of Lemma 14.21 the 3-term progression 2 — r < 1 < r lies in V, 
so 2 — r G V'. Thus r must be an integer, and by reflection symmetry in 
V\ we must have r — 2 < r', that is r G {r' + 1 ,r' + 2}. It is immediate 
that V' U {r' + 2} is of the form described in the proposition, so we are done 
if r = r' + 2. If r = r' + 1, then by Lemma 14.21 the 3-term progression 
r' — 1 < r' < r is contained in V, so then V' must consist of all consecutive 
integers from — r' to r', and then it is clear that V = V' U {r} is of the form 
described in the proposition. □ 
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Proposition 6.5. For n > k > 4, an n-subset V ofM.is optimal for k-term 
arithmetic progressions if and only if, up to similarity, V is an arithmetic 
progression or k — 1 divides n and V is an arithmetic progression with the 
second point deleted. 

Proof. We first analyze all optimal sets V for 4-term arithmetic progressions. 
Suppose that V is such an n-subset of R and n > 4. Let V = {V\ < V 2 < V 3 } 
with | V 2 1 > 2 be an <-orderly, balanced 3-decomposition of V. The technical 
Lemma 16.71 below shows that if we remove V 3 from V, then we are left with 
an optimal set for 3-term arithmetic progressions. Since V\ U V 2 is optimal 
for 3-term arithmetic progressions, Proposition 16.31 tells us that without loss 
of generality, we may take V\ U V 2 to be the union of a set E of consecutive 
even integers and a set O of consecutive odd integers, which either have a 
common center c (if \E U 0\ is odd) or E and O have distinct centers d and 
d + 1 in some order. 

We now prove that the distance between the leftmost two points of V 2 is 
1. Since V\ U V 2 = E U O and E and O are nonempty, V\ U V 2 contains a 
set of three consecutive integers. The integers are c — 1, c, c + 1 if \E U 0\ 
is odd or one of d — 1, d, d + 1 or d, d + 1, d + 2 if \E U 0\ is even. Then 
the three consecutive points are either the two rightmost of V\ and the one 
leftmost of V 2 or the one rightmost of V\ and the two leftmost of V 2 . The 
first case implies the second one, since if the rightmost two points of V\ 
have a spacing 1, then the leftmost two points of V 2 will complete a 4-term 
arithmetic progression with the former two points. So we know that distance 
between the leftmost two points of V 2 is 1 . 

By Lemma 16.71 the union of V 2 and V 3 is optimal for 3-term arithmetic 
progressions. Now since the leftmost two points of V 2 U V 3 are at distance 1, 
it follows from Proposition 16.31 that V 2 U V 3 is either an arithmetic progres¬ 
sion, or an arithmetic progression with the second-to-last point removed. 
Similarly, V\ U V 2 is either an arithmetic progression or an arithmetic pro¬ 
gression with the second point removed. By technical Lemma 16.81 below, 
V is either an arithmetic progression or else an arithmetic progression with 
either the second point or the second-to-last point (but not both) removed, 
and these latter two cases can only occur if 3 | n. Conversely, Lemma 16.81 
shows that sets V with these forms really are optimal for 4-term arithmetic 
progressions. 

Now we analyze all optimal sets for /c-terrn arithmetic progressions where 
k > 5. For n > k, let V = {V\ < V 2 < ■ ■ ■ < 14- 1 } be an <-orderly, balanced 
{k — l)-decomposition of a n-subset V of R, where we insist | V 2 1 >2. Then 
V' = V\ U ... U Vk -2 and V" = V 2 U ... U Vk-\ are optimal for (k — l)-term 
arithmetic progressions by Lemma 16.71 Hence, without loss of generality 
V' is either an arithmetic progression of consecutive integers, or what one 
obtains by removing either the second or second-to-last point (but not both) 
from such an arithmetic progression. If the rightmost two points of I4_2 
had spacing 2, then V" would begin with two points at spacing 1, then later 
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would have a spacing of 2 between consecutive points, neither of which would 
be endpoints, and this would contradict the optimality of V" for (k— l)-term 
arithmetic progressions. So any spacing of 2 between consecutive points in 
V' must be between the leftmost two points. And by a similar argument with 
V", we see that all consecutive points of V" have spacing 1 , except possibly 
the two rightmost, which can have spacing 2. Thus all consecutive points of 
V must have spacing 1, except for possible spacings of 2 between the leftmost 
pair and the rightmost pair. By Lemma 16.81 V is either an arithmetic 
progression or else an arithmetic progression with either the second point or 
the second-to-last point (but not both) removed, and these latter two cases 
can only occur if k — 1 | n) . Conversely, Lemma 16.81 shows that sets V with 
these forms really are optimal for /c-term arithmetic progressions. □ 

We close with the technical lemmata used to prove the two propositions 
above. 

Lemma 6 . 6 . Let V be a n-subset o/R that is optimal for 3-term arithmetic 
progressions. Then either the (n — 1 )-set obtained by removing the leftmost 
point ofV or the (n — 1 )-set obtained by removing the rightmost point ofV 
is optimal for 3-term arithmetic progressions. 

Proof. Let V = {V\ < V 2 } be a <-orderly, balanced 2-decomposition of V, 
with \ V\ \ > | V 2 1 - By Lemma [4.21 we see that any pair of points in V\ (resp., 
V 2 ) are the leftmost (resp., rightmost) two points of a 3-term progression in 
V. 

Suppose that the leftmost point £ of V is such that there is no 3-term 
progression £ < u < v with u. v G V 2 . Then let V' = V \ {£} with <-orderly, 
balanced 2 -deconrposition V = {Vf \ {£}, V 2 }, and we see that any pair of 
points in V\ \ {£} (resp., V 2 ) are the leftmost (resp., rightmost) two points 
of a 3-term progression in V', so V' is optimal by Lemma 14.21 

Now suppose that the leftmost point l of V is such that £ < m < r is a 
3-term progression with m,r € V 2 . Then m and r must respectively be the 
leftmost and rightmost points of V 2 , for otherwise Lemma 14.21 would dictate 
that the leftmost and rightmost points of V 2 would be a part of a 3-term 
progression in V, and that progression would need to involve a point to the 
left of £, which is absurd. 

We set V' = V \ {r}, and claim that it is optimal for 3-term arithmetic 
progressions. For any u < v G Li, we claim that the point w that makes 
u < v < w a 3-term arithmetic progression is in V'. For by Lemma 14.21 
w € V, and w = 2v — u, with v < m and u > £, so that w < 2m — £ = r 
(since £ < m < r is an arithmetic progression). 

If n is even, then set V = {Vf, V 2 \ {r}}, which is a <-orderly, balanced 2 - 
decomposition of V', and we see that any pair of points in V\ (resp., V^\{r}) 
are the leftmost (resp., rightmost) two points of a 3-term progression in V', 
so V' is optimal by Lemma 14.21 

If n is odd, let y be the rightmost point of V\, and set W = {W\ < W 2 } 
with W\ = V\ \ { y } and IL 2 = V 2 U {y}, which is a <-orderly, balanced 
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2-decomposition of V with IW 2 I > \W\\. Then set W = {W \, W 2 \ {r’}}, 
which is a <-orderly, balanced 2-decomposition of V' = V \ {r}. By Lemma 
a any pair v < w of points in W 2 form the rightmost two points of a 3- 
term arithmetic progression in V, so any pair v < w of points in W 2 \ {r} 
for the rightmost two points of a 3-term arithmetic progression in V'. By 
the paragraph before the previous one, we know that every pair of points 
in W\ = V\ \ {y} form the leftmost two points of a 3-term arithmetic 
progression in V'. Thus Lemma 14.21 shows that V' is optimal for 3-term 
arithmetic progressions. □ 

Lemma 6.7. IfV is a finite subset o/M that is optimal for k-term arithmetic 
progressions, and V = [V\ < ■ ■ ■ < 14_i} is a <-orderly, balanced (k — 1)- 
decomposition of V, then both U jZ\Vj and Vj are optimal for (k — 1 )- 
term arithmetic progressions. 

Proof. Let V' = ki k jZ( with <-orderly, balanced ( k — 2)-decomposition V = 
{Vi < ■■■ < 14 _ 2 }. By Lemma H~2l for a pair of points p and q in the subset 
Vj of V ', there is a L-term arithmetic progression P in V that contains these 
points and is only of echelon i in V for i = j. Note that P' = P \ (Pn V^-i) 
is an arithmetic progression of some length that is of echelon j in V. If P' 
has fewer than k — 1 points, then it must be true that more than one point 
of P is in I4_i, which implies that P is both of echelon k — 1 and echelon 
j in V, contradicting Lemma 14.21 Hence P' is an arithmetic progression 
of at least k — 1 terms contained in V'. By Lemma 14.21 V' is optimal for 
k — 1-term arithmetic progressions. By the same reasoning, is also 

optimal for ( k — l)-term arithmetic progressions. □ 

Lemma 6.8. Let n > k > 4. Let V be an n-subset of R that is either 
an arithmetic progression, or else an arithmetic progression with the second 
point p or the second-to-last point q (or both) removed. Then V is optimal 
for k-term arithmetic progressions if and only if (i) V is an arithmetic pro¬ 
gression or (ii) k— 1 divides n and V is an arithmetic progression with only 
one of p or q removed. 

Proof. We can assume, without loss of generality, that V C Z, that the first 
element of V is 0, and the smallest spacing between any two consecutive 
elements of V is 1. Write n = (k — 1 )q + r with 0 < r < k — 1. 

We already know from Theorem 16.11 and Lemma 16.21 that if V is an arith¬ 
metic progression, then it is optimal for /c-terrn arithmetic progressions. 

Now we assume that V is obtained from an arithmetic progression by 
removing only the second point, so that V = {0, 2,3,..., n}. Assume V is 
optimal for fc-term arithmetic progressions. 

If 0 < r < k — 1, let V = {V± < ■ ■ ■ < Vk-i) be a <-orderly, balanced 
(k — ^-decomposition of V, such that V\ = {0, 2,3,... , q + 1}. If we consider 
the fc-term arithmetic progression beginning with 0 and q + 1, its rightmost 
point is (k — 1 )(q + 1) = (k — 1 )q + (k — 1) > (k — 1 )q + r = n, which must 
be contained in V by Lemma 14.21 which is absurd. 
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On the other hand, if r = 0, then it is not difficult to use Lemma 14.21 to 
show that V = {0, 2,3,..., (A: — l)g} is in fact optimal for fc-term arithmetic 
progressions. By symmetry, we have also covered the cases where V is 
obtained from an arithmetic progression by only removing the second-to- 
last point. 

Now we assume that v is obtained from an arithmetic progression by 
removing both the second point and the second-to-last point, so that V = 
{0, 2 ,3,... , n — 2 , n — 1, n + 1}. Assume V is optimal for fc-term arithmetic 
progressions. Recall that we are writing n = (k— 1 )q + r with 0 < r < k — 1. 

If r = 0, let V = {V\ < ■■■ < I4_i} be a <-orderly, balanced (k — 1)- 
decomposition of V, in which V\ = {0, 2,3,... ,q}. If we consider the fc-term 
arithmetic progression beginning with 0 and q, then its rightmost point is 
{k — 1 )q = n, so Lemma 14.21 savs that n must lie in V, which is absurd. 

If 0 < r < k — 1, let V = {Vi < • • • < I4-i} be a <-orderly, balanced 
(k — ^-decomposition of V, such that V\ = {0,2,3,..., q} and V 2 = {q + 
1, q + 2,..., 2q + 1}. If we consider the fc-term arithmetic progression whose 
second and third points are q + 1 and 2 q + 1, then the first point must be 1, 
so Lemma 14.21 savs that 1 must lie in V, which is absurd. □ 


7. Commensurable Patterns in the Line 


Here the universe U is the line K, and our equivalence relation ~ is direct 
similarity. A commensurable pattern T in M is one such that for P € 7, all 
the distances between pairs in P are commensurable, that is, are related 
by rational ratios. Equivalently, IP is commensurable if it contains some 
P C Z. Or yet again, IP is commensurable if its instances are subsets of 
arithmetic progressions. Indeed if P E IP, there is a unique arithmetic 
progression A of minimum cardinality such that P C A. We call this A 
the enveloping arithmetic progression for P, and of course the set of all 
enveloping arithmetic progressions of elements of T is itself a pattern, called 
the enveloping pattern for CP. For a positive l, we let An be the pattern 
consisting of Aterrri arithmetic progressions. 


Theorem 7.1. Let IP be a commensurable k-pattern on M, and suppose that 
At is the enveloping pattern for CP. If n E N and r and s are respectively the 
remainders when n is divided by k — 1 and I — 1 , then 


(n 


s)(n + s - 1 + 1 ) . . [n- r){n + r — k + 1 ) 

21 — 2 - j - 2k-2 


Proof. The right side of the inequality follows directly from Lemma 14.21 
which gives an upper bound on the maximum number of instances of a 
/c-pattern CP in an n-subset of R. 

Now let A = {p < ■ ■ ■ < q} and A' = {p' < ■ ■ ■ < q'} be distinct I- 
term arithmetic progressions in R. Then they are enveloping arithmetic 
progressions for distinct instances of the commensurable pattern CP: P = 
{p < ■ ■ ■ < q} and P' = {p' < ■ ■ ■ < q'}. We see that in a given set, there are 
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at least as many instances of IP as there are £-term arithmetic progressions, 
so that Sj[ e (n) = (n — s)(n + s — I + l)/( 2 £ — 2 ) < Sj>(n). □ 

We now explore the commensurable pattern CP containing {0,1,3}. The 
enveloping pattern is A±. We know from Theorem 17.11 that Sj\ A {n) < 
Sy{n) < S A3 (n). Note from Theorem 16.11 that \mx n ^ ca S AA {n)/n 2 = 1/6 
and lirrin^oo S A 3 (n)/n 2 = 1/4. 

We now construct a family of sets V n where each V n is a set with n points 
containing Sy(V n ) = (3 n 2 — 8n)/16 directly similar copies of {0,1, 3}. This 
makes lim n ^. 00 Sy(V n )/n 2 = 3/16, which is strictly between the two limits 
we computed in the previous paragraph using the lower and upper bounds 
on Sy(V n ) furnished by Theorem 17. II We now describe our construction V n , 
and then prove our claim that Sy(V n ) = (3n 2 — 8n)/16. 

Construction 7.2. If n = 96 k for any positive integer k, we let V n be the 
union of the following four sets: 

M 0 = 6 Zn [0,108fc], 

Ml = (1 + 6 Z) n [72 k + 1,144fc - 5], 

M 3 = (3 + 6 Z) n [3,324A: - 9], and 
M 5 = (5 + 6 Z) n [72 k - 1,144fc - 7], 

or equivalently 

Mo = ( 6 a : 0 < a < 18A:}, 

Mi = ( 6 a + 1 : 12 k <a< 24 k - 1}, 

M 3 = ( 6 a + 3 : 0 < a < 54 k — 2}, and 
M 5 = ( 6 a - 1 : 12 k <a< 24 k - 1}. 

We show the smallest example of this construction in Figured! with points 
drawn as vertical strokes to make them easier to discern. 


Figure 4. Construction P with k = 1 (points drawn as 
vertical strokes) 


Proposition 7.3. If IP is the pattern containing {0,1,3}, and V n is the 
n-point set V n described in Construction 7. S\ then Sy(V n ) = (3n 2 — 8n)/16. 


Proof. To compute Sy(V n ) we first note that by choosing the congruence 
classes modulo 6 of the first two points of any P e IP, the class of the third 
point of P is fixed. For example, if we choose the first two points congruent 
to 5 (mod 6 ), then the third point must also be of the same class. Hence 
the 4x4 cases for the respective congruence classes of the three points 
are (0,0,0), (0,1,3), (0,3,3), (0,5,3), (1,0,4), (1,1,1), (1,3,1), (1,5,1), 
(3,0,0), (3,1,3), (3,3,3), (3,5,3), (5,0,2), (5,1,5), (5,3,5), and (5,5,5). 
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Henceforth we omit (1,0,4) and (5,0,2), as none of the points in V n are 
congruent to 4 or to 2 (mod 6). 

To compute the number of instances {p\ < P2 < P3} of 7 in V n with 
(pi,P2,P3) = (a,b,c) (mod 6) for a given triple (a, 6 , c), one counts lattice 
points in regions of Z 2 defined by a system of inequalities. For example, if 
n = 96 k and (a, 6, c) = (0,5, 3), we could represent each instance {p\ < P2 < 
Pz} of 7 with 


pi = 6x 

P2 = Gy - 1 

P3 = 3^2 - 2/1 1 = 6(3x - 2 y - 1) + 3, 
subject to the inequalities 


x <y, 

0 < x < 18/c, 

12 k < y < 24k — 1, and 
0 < 3y - 2x - 1 < 54/c - 2. 

The first inequality makes sure that p\ < p 2 , and the remaining three 
inequalities make sure that p \, P 2 , and p% are in the ranges prescribed for 
their respective congruence classes, as described in Construction 17.21 

In this way we calculate the following number of instances {p\ < P 2 < P 3 } 
of 7 in V n . The results for every instance with the corresponding triple 
(pi,P 2 ,P 3 ) (mod 6) are described in the following table: 


number of instances 

(Pi,P2,P3) (mod 6) 

54/c 2 - 3 k 

(0,0,0) 

171 k 2 + 6 k 

(0,1,3) 

270 k 2 + 9 k 

(0,3,3) 

171 k 2 + 3k-l 

(0,5,3) 

24 k 2 - 6 k 

(1,1,1) 

24k 2 + 2k 

(1,3,1) 

24 k 2 - 2k 

(1,5,1) 

54 k 2 + 3 k 

(3,0,0) 

189 k 2 - 6 k 

(3,1,3) 

486 k 2 - 45 k + 1 

(3,3,3) 

189 k 2 - 3k 

(3,5,3) 

24 k 2 + 2k 

(5,1,5) 

24 k 2 - 2k 

(5,3,5) 

24 k 2 - 6 k 

(5,5,5) 


If we add up all the counts, we get 1728A: 2 —48fc, or equivalently (3n 2 —8n)/16 
for the the total number of instances of 7 in V n . □ 
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8. Equilateral Triangles in the Plane 

In this section we explore S'g(n), where U is the Euclidean plane (identified 
with C), V is a finite subset, and £ is the pattern of vertices of an equilateral 
triangle. Our order relation on C is lexicographic ordering: y < z means 
that either (i) Re(y) < Re(^) or (ii) Re(y) = Re(z) and Im(y) < Im(z). This 
order relation respects addition, so if y -< y' and z z', then y + z -< y' + z'. 

If u and v are distinct points in C, they are vertices of only two equilateral 
triangles, and if we let w and w' be the respective third vertices of these two 
triangles, then u, w, v, and w' are vertices of a parallelogram with center 
(u + v)/2 = (w + w')/ 2, as shown in Figure [5] below. 


u 



Figure 5. Points u and v are vertices of two equilateral 
triangles, A uvw and A uvw', which together form a parallel¬ 
ogram with center at (u + v)/2 = (w + w')/2. 

Then the compatibility of our order relation with addition shows that £ 
admits at most one reconstruction from any pair of consecutive points, for 
neither u,v -< w, w' nor u,v w, w' is consistent with {u+v)/2 = (w+u/)/2. 

Since £ admits at most one reconstruction from any pair of consecutive 
points, Theorem 14.11 immediately tells us that Si{n) < n(n — 2)/4 if n is 
even and S'g(n) < (n — l) 2 /4 if n is odd, that is, S'g(n) < [(n — 1) 2 /4J in 
any case. This observation recovers upper bound on Sg(n) of [21 Theorem 
1 ] (given here in ([I])), which was previously obtained using a much more 
sophisticated geometric proof. 

We shall get an improved upper bound on Sg(n) by noting that sometimes 
£ admits no reconstruction from a pair of consecutive points. To see this, we 
define the usual principal value of the argument function Arg: C* —>• (— 7 r, tt\. 

Lemma 8.1. Let u,v be distinct points in C. Then these are the first two 
vertices (under fi.) of precisely one equilateral triangle and last two vertices of 
precisely one equilateral triangle ifAvg(v — u) E (— 57 t/ 6 , — 7 r/ 6 ]U( 7 r/ 6 , 57 t/ 6 ]. 
Otherwise, they are neither the first two nor the last two vertices of any 
equilateral triangle. 
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One can visualize the result of Lemma 18.11 there are always two equilat¬ 
eral triangles with vertices at u and v, but as we vary Arg(u — u) in Figure 
El below, we can see which arguments make u and v the first and second 
points (under our lexicographic ordering) of one of these equilateral trian¬ 
gles. Equilateral triangles for which u and v are the first and second points 
are drawn in black, while the rest are drawn in grey. 


v *. 



Arg (v-u) = -j 2 Arg(u-u) =-§ Arg (v - u) =-^ 


Figure 6 . Equilateral triangles with vertices at u and v: in 
solid black are those that make u and v the least two vertices 
under lexicographical ordering A. 

Proof of Lemma ES ' Let ^6 = e m ^ . Then u and v are vertices of only two 
equilateral triangles, with the third point being either (i) w = £ 6 u + Cqv or 
(ii) w' = C 6 u + (ev. 

Note that u,v -< w requires Re(ic) > Re(it) (with Im(u;) > Re(u) in case 
of equality) and R e(w) > Re(u) (with Im(ic) > Irn(u) in case of equality). 
Since C 6 + (e = 1, this is equivalent to Re(£e(^ — u)) > 0 and Re (—~ 
u)) > 0, with the corresponding imaginary parts strictly positive in cases of 
equality. These conditions are fulfilled if and only if Arg (v—u) € (vr/ 6 , 57 r/ 6 ]. 
One can similarly show that u, v >- w' under the same conditions, and that 
u, v -< w' if and only if Arg(u — u) € (— 57 t/ 6 , — 7 t/ 6 ], with u, v w under 
the same conditions. □ 

Let A = (—57t/6, — 7t/6] U (vt/6, 57t/6], the set of values of Arg(u — u ) that 
admit a reconstruction of £ with u and v as consecutive points. To get a good 
upper bound for S^(V) in an n-set V, we rotate V in such a way that the 
first \n/2 \ points of V (with respect to ordering P) have many pairs (u, v ) of 
points with Arg(u — u) $ A, and the last |_ n /2j points of V have many pairs 
(it, v) of points with Arg(it — v) 0 A. Thus when we decompose our set V 
into a balanced ^-orderly 2-decomposition V = {V \, V 2 }, many of the pairs 



ON THE NUMBER OF SIMILAR INSTANCES OF A PATTERN IN A FINITE SET 19 


in V\ admit no reconstruction of which they are the first two points, and 
likewise may of the pairs in V 2 admit no reconstruction of which they are the 
last two points, so that we may deduct the counts of these “unproductive” 
pairs from the standard upper bound of |_(n — 1) 2 /4J implied by Theorem 
m This requires some care, for as we rotate, points change from being 
among the first [~n/2] points of V to being among the last [n/2\ points of V 
and vice-versa, that is, our decomposition V = {Vi, V 2 } must change (points 
migrate between V\ and V 2 ) as we rotate V. We now indicate roughly how 
we overcome this difficulty, with technical details given later. We eventually 
shall show that, up to translation and rotation, we may assume that (i) 
approximately half of the points in V lie on each side of the y-axis, and (ii) 
approximately half of the points in V lie on each side of the line y = x/y/3, 
and (iii) approximately half of the points in V lie on each side of the line 
y = —x/y/3. These three lines cut the plane into six compartments, and we 
call pairs of points lying within the same compartment intracompartmental 
pairs. We then rotate this picture about the origin by angles of 0, 2n/3, and 
4-7t/3. Each intracompartmental pair (u,v) will have Arg (y — u) € A for two 
of these rotations, and Arg(v — u) 0 A for one of these rotations. Thus we 
may choose a rotation such that at least one-third of the intracompartmental 
pairs u, v have Arg (v—u) fL A. Sometimes points lie on the boundaries: such 
technical issues can be handled naturally using the formalisms that follow. 

We define a direction ( to be complex number of unit modulus, so the set 
of directions is the complex unit circle. A directed line with direction £ is 
line with parameterization t ^ Qt + rj for some ry € C, where the parameter t 
ranges over R. Every directed line with direction £ may be uniquely written 
as L(t ) = (t. + s(—i(), where s is a real scalar uniquely determined by the 
line L ; we call this the canonical form of the directed line L. Figure [7] shows 
the relation between the directions £ and — if, and the parameterization of 
the directed line L{t). Note that |s| is the distance from L to the origin, and 
the sign of s tells us which side of L the origin lies on. We may thus identify 
the set of directed lines with the Cartesian product of the unit circle (for 
C) and the real line (for s), and give the directed lines the topology of this 
product space. 


For each direction (, we define a total ordering relation on C with 
y z if and only if ify A ifz. where A is our usual lexicographic ordering 
on C. Note that Aj is the same as A, and for any direction £, we have 
y z if and only if y z. If L(t) is a directed line with direction £, 
then we say z € C lies to the left (resp., to the right ) of L(t) if 2 -<£ L(t) for 
all t € R (resp., z L(t ) for all t € R). If y,z & C are on the line L(t), 
say y = L(t y ) and 2 = L(t z ), then we say that y is below (resp., above ) z 
on L if t y < t z (resp., t y > t z ). Note that these concepts of left and right, 
below and above, all correspond to the usual notions when L(t) = it is the 
positively directed y-axis, or indeed if L is any translate thereof. 
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Im L 



Figure 7. The directed line L(t) = (t + s(— if) (with con¬ 
stants s E R and £ € C with |(| = 1) has direction ( and 
is distance |s| from the origin. In this particular instance, 
s < 0 , which means that the origin is to the right of a viewer 
standing on the line and facing in its direction. 


If ( is a direction and V = {ui -<£ • • • v n } is a finite set, then the 
(-median ofV is v^ n+1 y 2 if n is °<id, and is (v n / 2 + v n /2+i)/2 if n is even. If 
( is a direction and V is a finite set of points, then the (- halving line of V 
is the directed line Ly^ with direction ( that passes through the ("-median 
of V. The line is called a halving line because \n/2\ of the points of V are 
(with respect to less than the median, and so lie to the left of the line or 
below the (-median of V on the directed line, while |_ri/2j of the points of V 
are greater than the median, and so lie to the right of the line or above the 
median on the directed line. Our definition of halving line makes a unique 
halving line for V in each direction, and is identical to the definition used 
by Erickson, Hurtado, and Morin [21] . If c is the (-median of V, we call any 
point z with z rif c formally to the left of Ly^ and any point z with z)~^c 
formally to the right of Ly^. Thus \n/ 2 ] of the points of V lie formally to 
the left of Ly^ and [n/2\ of the points of V lie formally to the right of Ly(- 
With the topology for the space of directed lines introduced above, we 
claim that Lv.c, is continuous in (. Indeed, Ly^ evolves by rotating about 
a particular median for a segment of values of (, changing from median v 
to w precisely for the value of ( that makes v and w both lie on the line 
Ly Note that the (-median and the (—()-median of V are the same, so 
that Ly } £ and Ly-^ are the same line, but with opposite directions. These 
considerations lead to the following useful observation, first proved in this 
precise form in eh Lemma 3] (see also |22| Lemma 2] and ( 141 Lemma 2]). 
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Lemma 8.2. Let V be a finite subset of C. Then there is some direction 
C such that the halving lines of V in directions (, and e 47n / 3 C are 

concurrent. 

Proof. Let u> = e 2?r F 3 . If the point Ly^ LI L Vw i is on or to the left (resp., 
right) of the directed line Ly, 1, then the same point Ly fl L v _ u i must be 
on or to the right (resp., left) of the same but oppositely directed line Ly- i. 
As C traverses the unit circle, the lines Ly^, Ly )UJ g, and L v ji^ evolve contin¬ 
uously, and so the intersection point Ly^CtLy^^ also evolves continuously. 
Thus there must be some Q such that Ly^ fl crosses Ly^. □ 

Now we are ready to recall and prove our improved bound fTheorem ll.111 
on S £ (n). 

Theorem 8.3. If £ is the pattern of the vertices of equilateral triangles in 
M 2 , then 5g(n) < |_(4n — 1 )(n — l)/18j. 

Proof. Let V be an n-subset of M 2 (identified with C). Obtain three concur¬ 
rent halving lines of V, say L, M, and N, as described in Lemma [8.21 where 
the directions of M and N are obtained from that of L by rotating by 2ir/3 
and 47r/3, respectively. This means that a point can only be formally left (or 
formally right) of all three lines if it lies on all three. So we may decompose 
V into six or seven disjoint classes of points (called compartments ), where 
a point is classified according to its position (formally left or right) relative 
to the three lines. 


Position Relative to Line 


Compartment 

L 

M 

N 


formally left 

formally right 

formally left 

f 2 

formally left 

formally right 

formally right 

f 3 

formally left 

formally left 

formally right 

f 4 

formally right 

formally left 

formally right 

f 5 

formally right 

formally left 

formally left 

f 6 

formally right 

formally right 

formally left 


on on on 

(if this point is not in V\ U • • • U V^) 


Note that V7 is nonempty if and only if a point of V happens to lie on 
the intersection of the three lines and if said point happens to be formally 
left of all three lines or formally right of all three lines. We depict our 
compartments in Figure [H 

Let A = (—57t/6, — 7t/6] U (vt/6, 57t/6], the set of values of Arg(u — u) that 
(by Lemma 18.11) admit a reconstruction of £ with u and v as consecutive 
points. We call pairs of points in U') =1 Vj x Vj intracompartmental pairs. 
We can rotate V and the directed lines L, M, and N so that one of them is 
t i->- it, the positively directed y-axis. Each intracompartmental pair (u, v ) 
will have Arg(u — it) € A for two of these rotations, and Arg(u — u) ^ A 
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L 



Figure 8 . The seven possible compartments, where V 7 can 
only contain the intersection point of the three lines 


for one of these rotations. Thus we may choose a rotation such that at 
least one-third of the intracompartmental pairs u, v have Arg(u — u) 0 A. 
Without loss of generality, let us say that our chosen rotation makes the line 
L the positively directed y-axis. 

Every instance {v± -< V 2 -< U3} of £ in V either has v\ and V 2 formally 
to the left of L or has V 2 and V 3 formally to the right of L. Furthermore, 
Lemma f8. 1 1 shows that in the former case Arg(u2 — v±) E A and V 3 is uniquely 
determined by v\ and V2, while in the latter case Arg(u3 — V2) E A and v\ is 
uniquely determined by V 2 and V 3 . Since L is a halving line, there are \n/ 2] 
points formally to the left of L and |_ n /2j points formally to the right of L. 
So we count the number of pairs (■ u,v ) in V with u and v formally to the 
same side of L, and deduct the number of intracompartmental pairs that we 
deliberately rotated to prevent them from admitting a reconstruction of £ 
to obtain 


(3) 


S £ (tO < 



Now ^j = i \ Vj \ is either n or n—1 (depending on whether V7 is empty or not), 
and by convexity, the sum of binomial coefficients in dH) is minimized when 
the various values \ Vj\ are as close to equal as possible. So if n—1 = 6q+2r+s 
with q E Z, r E {0,1,2} and s E {0,1}, then 



> ( 2 r + s ) 



+ (6 — 2r 


= q(3q + 2r + s — 3). 



(4) 
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In this case, we see that the first two terms of (J3]) become 



= (3 q + r)(3q + r + s), 

and substituting ([4J and © into (j3J), we obtain 

s lyj < 8g(g + 2r + s) + 3 q + 3 r(r + s) 

3 

which works out to 


Se(V) < 


2 n 2 _ 5 _ _ 1 

9 1 IS" 1 3 

2 re 2 _± n+ ± 
9 n 18 "" ^ 18 

2 re 2_5_ n _ I 

9'‘ 18 n 6 

2 n 2 _ A n _ 4 
9 U is 1 9 


if re = 0 or 2 (mod 6), 
if re = 1 (mod 6), 
if re = 3 or 5 (mod 6), 
if n = 4 (mod 6), 


so that iS£(V) < L(4n — l)(n — l)/18j for arbitrary n. Since V was (up 
to our rotation) an arbitrary re-subset of C, this upper bound holds for 
S £ (n). □ 
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